TYPE II DNA: when the interfacial energy becomes negative 
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An important step in transcription of a DNA base sequence to a protein is the initiation from the 
exact starting point, called promoter region. We propose a physical mechanism for identification of 
the promoter region, which relies on a new classification of DNAs into two types, Type-I and Type- 
II, like superconductors, depending on the sign of the energy of the interface separating the zipped 
and the unzipped phases. This is determined by the energies of helical ordering and stretching over 
two independent length scales. The negative interfacial energy in Type II DNA leads to domains of 
helically ordered state separated by defect regions, or blobs, enclosed by the interfaces. The defect 
blobs, pinned by non-coding promoter regions, would be physically distinct from all other types of 
bubbles. We also show that the order of the melting transition under a force is different for Type I 
and Type II. 



DNA in its double helical form shows a resilience 
against an external pulling force. The bound state does 
not allow a force g applied at an end to penetrate up 
to a critical force g — g c , above which the DNA gets 
unzipped [Tj-Q. The transition is first order for tempera- 
tures T < T c where T c is the denaturation(melting) tem- 
perature in the absence of any force Q . The force- induced 
unzipping transition of DNA is due to a competition be- 
tween the bond orientation by force and ordering by base 
pairing. The formation of a helically ordered dsDNA 
from denatured strands is a symmetry breaking transi- 
tion. At a coarse-grained level, the ordered state can be 
described by an order parameter ijj, with tp = for the de- 
natured state. The external force does not couple directly 
to this order parameter. Consequently, at a junction of 
a bound and an unzipped DNA, there is a need to define 
two length scales: one scale £ that gives the length over 
which the DNA ordering is damaged on the bound side of 
the interface, while the other scale A gives the distance 
over which the force penetrates the bound state. The 
existence of the second scale A was pointed out by de 
Gennes in a model involving stretching of the backbone 
and the hydrogen bonds Generally one expects in- 
terfaces separating phases to be energetically costly (e.g. 
surface tension), but here we show that if A 3> £, then the 
interfacial energy, or surface energy, between bound and 
unzipped DNA can become negative. There can then be 
a penetration of force in the form of distorted regions or 
"defect blobs" of length A enclosing a denatured bubble 
of size £. In analogy to superconductors, when the inter- 
facial energy becomes negative, one gets a mixed phase 
of DNA and the zipped-mixed phase transition becomes 
continuous. Based on the sign of the zipped-unzipped 
interfacial energy we classify DNA into two types: Type 
II has negative interfacial energy whereas Type I is the 
conventional case with positive interfacial energy. This 
classification is not related to the existing classification 
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based on DNA conformation. 

A Type II DNA has novel features which are of consid- 
erable biological and physical implications. To be noted 
that the defect blobs arc different from thermally cre- 
ated bubbles. This is because the bubbles of the lat- 
ter type would consist of random configurations of de- 
natured strands generated by thermal fluctuations and 
may have positive interfacial energy. The distinctness of 
the defect blobs can be a signature for their identifica- 
tion in biological processes. Let us consider the tran- 
scription process where the genetic code, determined by 
the base sequence, is transferred to the amino acid se- 
quence of a protein. For correct transcription, the se- 
quence must be read from the correct starting point on 
DNA. These starting non-coding regions are called pro- 
moter regions and their identification is the first and vi- 
tal step in transcription [6]. A pulling force or a forced 
separation in a homogeneous Type II DNA produces a fi- 
nite density of the defect blobs [3] (discussed later). The 
noncoding sequences or the promoter regions may act as 
inhomogeneities on a DNA and could play the role of 
pinning centers for the defect blobs. The advantage of 
physical identification of pinned defect blobs could facili- 
tate recognition of the promoter regions for gene expres- 
sion (e.g. see @, 0). So far as physical properties are 
concerned, Type I and Type II DNA will have different 
phase diagram and phase transition as discussed later. 

Recently, both in experiment [loj and simulation (Tlj. a 
continuous transition has been observed if the topology is 
preserved in a stretching experiment by pulling both the 
strands either at both ends or at one end of an anchored 
DNA. We also note that a detailed molecular dynamics 
study of under- or over- wound DNA without writhe, 
observed the formation of localized sequence-dependent 
defects which allow the rest of the dsDNA to be in the 
relaxed normal state. It is known that topoisomerase II 
may bind anywhere on the DNA but its topology chang- 
ing activity is restricted to specific sequences (cleavage 
sites) indicating that geometric distortions get localized 
around certain sequences fl3j|. These are consistent with 
our general predictions, though we like to add that inter- 
facial information in any of these cases are not available. 

The thermodynamic description of unzipping of DNA 



2 



requires three variables, ip describing the helical ordering 
(i.e., broken symmetry) and a force-displacement (g,x) 
conjugate pair, where x is the scaled separation between 
the two strands at the point of application of force g. 
On the bound side x can be taken as the response to an 
internal induced force g, so that, 



x{g) = X9, 



(1) 



where \ the stretchability or the response function, is 
independent of g in the linear response regime. Though 
we restrict to linear response regime here, the final results 
can be reproduced for a general force-dependent x- The 
variables are chosen such that ip = for the unzipped 
state, and ip ^ for the ordered state, while g — 
in the bulk of the ordered state. At this point it is to 
be noted that the order parameter ip represents helical 
ordering which is not directly coupled to the external 
pulling force. As a result we get two independent length 
scales in the problem. This makes the present treatment 
different from other existing models. 

For a homogeneous state, the Gibbs free energy G(T, g) 
per unit length at temperature T and a pulling force g is 
given by 



G(T,g)=G(T,0)-W(g), 



(2) 



where W(g) — J 9 x(g') dg' is the work for stretching. 
The conditions of phase coexistence at g = g c , i.e. 

G z (T, 5c ) =G u (T,g c ), (3) 

and non-penetration of force in the bound state for g < 



G z (T,g c ) = G u (T,g c ), 



(4) 



(subscripts z and u representing the zipped and the un- 
zipped phases), when combined with Eq. [51 give 



G Z (T, g) = G u (T, g) + W(g) - W(g c ). 



(5) 



Eq. \E\ agrees with the known exact results of Ref. [2[ 
when appropriate x{g) from the exact solution is used. 
In particular one verifies that G z — G u = \x{g 2 ~ 9c)> m 
the linear response regime (near melting). 

Compared to the stretched unzipped state, the zipped 
phase has to pay a cost W(g) for force expulsion for not 
following the force-diktat, but gains energy W{g c ) due 
to binding or ordering. The phase coexistence requires a 
perfect compensation of one by the other. This compen- 
sation may be used to obtain the binding energy of the 
zipped phase as 



E Z (T) = W(g c ). 



(6) 



This equation may also be used to define g c from the 
binding energy. 

Let us now consider an inhomogeneous situation of 
a dsDNA at T < T c by pulling at one end by a force 
g = g c {T) so that there is an interface separating the 



coexisting zipped and unzipped phases. The interfacial 
energy is obtained by comparing this mixed state free en- 
ergy with that of a fully unzipped homogeneous state at 
g = g c . Needless to say that an interface can be created 
spontaneously if there is a gain in energy in doing so. 

Since far from the interface, the Gibbs free energy den- 
sity is the same in the two phases, the total free energy 
Q can be written as 



G U (T, g c )dz + (j, 



(7) 



where a is the "surface tension" , and z is a contour length 
measured along the DNA or the strands, the z = point 
being chosen at the point of interface with z < as the 
unzipped phase. 

We start with the free energy functional 



dz F{ip, x}, 



(8) 



whose minimum gives the equilibrium free energy in a 
fixed distance ensemble. The functional F{ip, x} can be 
taken as 



T{i>,x) = F u +F{iP} 



dip 

dz 
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gix) dx. 



(9) 



where F{ifj} is the free energy of the homogeneous bulk 
zipped phase with reference to the unzipped state free 
energy F u . In the unzipped state F{ip} = 0. K$ and K x 
are additional "elastic" constants for distortions in i/j and 
x. The elastic part of the free energy can be extended 
to torques. The order parameter ip and force g are not 
coupled in the free energy in the form taken in Eq. |H] and 
consequently, this form is valid only in extreme limits. 
Further generalizations are not needed for this paper. 
The Gibbs free energy is obtained from Eq. [9] by using 
the equilibrium values of ip and x, followed by a Legendre 
transformation from x to g. 
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FIG. 1: Schematic diagram of variation of zipping-unzipping 
order parameter ip (continuous line) and applied force g 
(dashed line) inside unzipped and zipped phases. £ is the 
length scale of variation of ip inside the zipped phase and A is 
the scale for g. For Type I (left figure), k = A/£ <C 1 and for 
Type II (right figure), k>1. 
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The equilibrium conditions, obtained by minimizing 
Ftot, are 



SF K d 2 ip _ 

Sip dz 2 

„ d 2 x x 
-K x — + - = 0, 



(10) 
(11) 



with the condition that 

ip = 0, x = x c = X9c at z = 0, (12) 

and 



ip = ip , x = 
ipo being the solution of 



at 



oo, 



(13) 



SF 

to maximize the interfacial energy. The length scales £ 
and A, giving how fast ip or g grow or decay inside the 
zipped phase (see fig. [J), come from Eqs. [TU1 and [Til as 



r 



i 



1 dF 

ip dip 



and A 2 = K xX . (14) 



The equation for A reduces to the form derived by de 
Gennes [5| if the elastic constants of his model are used 
for K x and x- The dimensionless ratio k = A/£ is ex- 
pected to be different for different sequences of DNA. 

For c« 1, the external force penetrates only a short 
distance A into the zipped region. In contrast the order 
parameter rises to its asymptotic value ipo in a much 
larger length £. One has to pay the energy cost for the 
damage in ordering over a length scale £, and therefore, 



(15) 



which is positive. This is the conventional scenario 
of force expulsion of various models on the zipping- 
unzipping phase transition and this scenario gives the 
well-known behavior of the unzipping transition. 

When k»1, the force penetrates a greater distance A 
into the sample, so that there is an obvious gain in the 
stretching energy (i.e. reduction of the "positive energy" 
for force expulsion) over the interval of penetration, over 
and above the gain by ordering. With x from Eq. 111! 
Eq. gives 



X9 C 



A, 



(16) 



which is negative. Hence, it is possible to lower the free 
energy of the DNA by creating the interface. The value 
of k for transition from Type I to Type II depends on the 
form of x which, in turn, depends on the DNA sequence 
and the secondary structure. It is therefore primarily the 
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FIG. 2: Schematic diagram of a periodic array of defect blobs. 
The array has a periodicity d which controls the density of 
blobs. Each distorted region is of length ~ A with an unzipped 
core of size ~ £. 



sequence but also the secondary structure that determine 
whether a DNA would behave like Type I or II. 

If we now consider the bulk zipped state with a force 
g, then force penetration may be possible in the form 
of many isolated distorted regions or blobs. For A> ^, 
with the unzipped core of size £ costing an energy E z t;, 
and the x part of the free energy !F{ip,x} in Eq. [§1 one 
finds for a homogeneous chain that a periodic struc- 
ture of the blobs Q, as in fig. [3J is possible energeti- 
cally, if g > g c /y^K. The initial penetration of force is 
at g e i = g c / with periodicity d — > oo. The unzip- 
ping transition therefore becomes continuous in contrast 
to the first order nature for Type I. 

The negative interfacial energy is found in Type II 
superconductor [14j too. Our formulation is similar to 
that of Type II superconductivity in a one-dimensional 
geometry. As there is indeed a phase transition in DNA, 
the Landau theory is justified here. It suffices for a one 
dimensional case to consider a scalar order parameter. 

We may point out a few additional implications of a 
negative interfacial energy. The penetration of the force 
is not possible in the conventional polymer models. For 
any helical or twisted pairs of strings, a pulling force pro- 
duces over- winding. We expect this over- winding in DNA 
to be present at the interface, distorting but not vitiat- 
ing the ordered state. The resulting distortion plays a 
role in determining the interfacial energy. The penetra- 
tion of force is via a denatured core of size £, surrounded 
by such a distorted region of size A. These defect blobs 
could be pinned by certain sequences, thereby localiz- 
ing them in specific regions of the DNA. We speculate 
that the regions which localize the defect blobs are the 
non-coding promoter regions. This gives a topological 
interpretation of the defect blob and it would also be ap- 
plicable to torque. The existence of the mixed or Type 
II phase with pinned defect blobs will affect the melt- 
ing profile under a force, and the force-distance isotherm 
will show steps originating from the blobs, especially for 
finite chains. Our analysis shows that the relation be- 
tween ordering and unzipping is needed to get a negative 
interfacial energy. The helical ordering is not just base- 
pairing - it involves stacking and other distant neighbor 
interactions. Any microscopic model for Type II DNA 
would have to take these into account. On the exper- 
imental front, it is time for a second generation single 
molecular experiments that would explore the interfaces 
on DNA. 
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To summarize, in this paper we showed that differ- 
ent types of phenomena happen for two regimes of the 
ratio k = j of the independent length scales £ and A, 
of DNA order parameter (ip) and internal force (g) re- 
spectively. For k« 1, the interfacial energy is positive, 
and the unzipping or melting under a force is first order. 
The external force has no effect inside the ordered, or, 
zipped phase, i.e., there is no internal force (g) inside as 
A is small. This is named Type I. On the contrary, for 



k 1, the interfacial energy becomes negative and the 
force penetrates the zipped phase in the form of defect 
blobs. The creation of interfaces are energetically fa- 
vored, so that interfaces are formed spontaneously. Thus 
defect blobs are formed inside the ordered phase. Above 
a force threshold g > g c \ , there will be a finite density of 
these defect blobs. The melting under tension of unzip- 
ping is second order. This case is named Type II. 
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